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^O ' Abstract 

We describe a novel extension of subspace codes for noncoherent networks, suitable 
for use when the network is viewed as a communication system that introduces both 
5/5 ' dimension and symbol errors. We show that when symbol erasures occur in a signif- 

icantly large number of different basis vectors transmitted through the network and 
when the min-cut of the networks is much smaller then the length of the transmitted 
codewords, the new family of codes outperforms their subspace code counterparts. 

For the proposed coding scheme, termed hybrid network coding, we derive two 
upper bounds on the size of the codes. These bounds represent a variation of the 



in 
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Singleton and of the sphere-packing bound. We show that a simple concatenated 
scheme that represents a combination of subspace codes and Reed-Solomon codes is 
asymptotically optimal with respect to the Singleton bound. Finally, we describe two 
efficient decoding algorithms for concatenated subspace codes that in certain cases have 
smaller complexity than subspace decoders. 



x. 

1 Introduction 

Network coding is a scheme introduced by Ahlswede et al. [1] for efficient communication 
over networks with transmission bottlenecks. The authors of pQ showed that under broadcast 
scenario in networks, the maximal theoretically achievable communication rate - called the 
capacity of the network - can be characterized by minimal cuts in the network and achieved 
by appropriate coding methods. 

In the last decade, network coding became a focal point of research in coding theory. 
There exists a variety of network coding solutions currently used in practice. A random 
network coding approach was first proposed in [9j, while algebraic coding was shown to 
achieve the capacity of a class of networks in [T5| |T%] . Non-linear approaches were also 
studied in [3]. The use of network coding for error correction was first proposed in [2]. When 



the network topology is not known, or when it changes with time, it was suggested in [13 
to use subspace coding for joint error correcting and network coding. 

In our work, we follow the line of research started in [13]. More specifically, we consider 
error-correction for a special case of network coding, suitable for practical applications in 
which the topology of the network in not known or changes with time. This type of scheme 
is, as already mentioned, known as coding for noncoherent networks. Currently, the only 
known approach for noncoherent network coding utilizes subspace codes. 

Subspace codes for noncoherent network coding are based on the idea that the transmitted 
data vectors can be associated with linear vector subspaces. Linear network coding does not 
change the information about the subspaces, since it only allows for linear combining of the 
transmitted bases vectors. Hence, if there are no errors, the receiver obtains uncompromised 
information regarding the transmitted subspace. The transmitted subspaces can only be 
modified within the network through the introduction of errors. In order to protect the 
transmitted information in the latter case, one has to add carefully structured redundancy 
into the subspace messages. 

In the context of the work [T3], the errors are modeled as dimension gains and dimension 
losses. These notions, although of theoretical value, may appear rather abstract in real 
networking applications, where packets (symbols or collections of symbols) are subjected to 
erasures or substitution errors. One fundamental question remains: how is one to interpret 
the notion of dimension gains and losses in terms of symbol errors and erasures, and what 
kind of errors and erasures constitute dimension gains and losses? 

We propose a hybrid approach to noncoherent network coding, which attempts to connect 
the notions of dimension loss and gain with those of individual symbol errors and erasures. 
The crux of our approach is to consider network coding where dimension gains and losses, in 
addition to individual symbol errors and erasures, are all possible. This allows us to study the 
trade-offs between the required overhead in the network layer aimed at correcting dimension 
gains/losses, and the overhead in the physical layer designated to correcting symbol erasures 
and errors. 

Our main result shows that by incorporating symbol error correcting mechanism into 
subspace codes, one can increase the number of tolerable dimension gains and losses, without 
compromising the network throughput. Hence, the proposed approach leads to an increase 
in the overall number of correctable errors in the subspace-based scheme akin to [13] . 

In order to illustrate our approach, consider the following straightforward example. As- 
sume the case of a noncoherently coded network in which arbitrary (unknown) ten symbols 
are erased from the basis vectors representing the message. The first question is how many 
dimension losses should be considered in the model of [13]? One reasonable way to look at 
it is to assume the worst-case scenario when each symbol erasure introduces one dimension 
loss, and each error introduces a simultaneous dimension loss and gain. Consequently, ten 
symbol erasures would amount to ten dimension losses. However, if there were an alternative 



way to correct some of these symbol erasures or errors, the effective number of dimension 
losses and gains may become significantly smaller. In the example, correcting five symbol 
erasures would, in the best case, reduce the burden of subspace codes in terms of dimension 
loss recovery by five dimensions. 

We therefore pose the following questions: what are the fundamental performance bounds 
for noncoherent network coding schemes, consequently termed hybrid network codes, capable 
of correcting symbol erasures and errors on one side, and dimension gains and losses on the 
other side? What is the optimal rate allocation scheme for hybrid network codes in terms of 
dimension losses/gains and symbol errors/erasures? What is the optimal ratio between the 
two allocation rates and how can it be achieved practically? How does one efficiently correct 
errors in this new scheme? The work in this paper is aimed at answering these questions. 

There are various potential applications for the proposed network codes. The hybrid 
codes can be useful in networks, where no link-layer error correction is performed. Such 
networks include sensor networks, where the computational power of the intermediate nodes 
is not sufficient to perform error correction at every such node. The hybrid codes can also 
be used in the networks, where the size of a physical layer packet is very small, and where 
the network layer packet consists of many physical layer packets. Then, any physical layer 
packet can be regarded as a single symbol. 

The paper is organized as follows. The notation and prior work are discussed in Sec- 
tion |2j In the sections that follow, we define hybrid codes that can be used for simultaneous 
correction of dimension losses/gains and symbol erasures in noncoherent networks. More 
specifically, the basic code requirements and parameters are presented in Section El Two up- 
per bounds on the size of hybrid codes, the Singleton bound and the sphere-packing bound, 
are presented in Section HI A straightforward code construction appears in Section 15.11 The 
analysis of code parameters and the comparison with known subspace code constructions 
also appear in the same section. The decoding algorithm for the proposed codes is presented 
in |H1 In the Appendix we show that the same codes can also be used for simultaneous cor- 
rection of dimension losses and symbol erasures/errors. We state some results analogous to 
those in Sections I3HEJ Finally, we discuss some results related to simultaneous correction of 
both dimension losses/gains and symbol erasures/errors. 



2 Notation and Prior Work 

Let W be a vector space over a finite field ¥ q and let V, U C W be linear subspaces of W. 
We use the notation dim(V) for the dimension of V. We denote the sum of two subspaces 
U and V as £7 + V = {u + v : u eU,v G V}. If U D V = 0, then for any w G U + V there 
is a unique representation in terms of the sum of two vectors w = u + v, where u G £7 and 
v G V. In this case we say that U + V is a direct sum, and denote it by U ® V. It is easy 
to check that dim(U ®V) = dim(U) + dim(F). 



Let W — U' © U". For V CW we define a projection of V onto [/', denoted by V\u>, as 
follows: 

V\u> = {ui : ui + u 2 e V, ui e U', u 2 e U"} . 

Similarly, we denote the projection of the vector u onto U' by (u)\u'- For two vectors, u 
and v, we write it • v to denote their scalar product. If W — U' © U" and for all u £ U', 
v G Z7" it holds u ■ v = (i.e. £/' and [/" are orthogonal), we also write W = U' Q ^' ; - 

For a set of vectors S 1 C iy, we use (5) to denote the linear span of the vectors in S. We 
also use the notation (ui, n 2 , • • • , ui) for a vector span of the set of vectors {iti, u 2 , • • • , ui}. 
Let N be the set of the positive integer numbers. We write m to denote the all-zero vector 
of length m, for any m G N. When the value of m is clear from the context, we sometimes 
write rather than m . We also denote by e, = (0, . . . , 0, 1, 0, . . . , 0) G F™ a unity vector 

i— 1 n—i 

which has a one in position i G N and zeros in all other positions. The length of the vector 
will be clear from the context. 

Assume that dim(W / ) = n. We use the notation V(W, £) for the set of all subspaces of W 
of dimension £, and V(W) for the set of all subspaces of W of any dimension. The number of 
^-dimensional subspaces of W, < £ < n, is given by the g-ary Gaussian coefficient (see 
Chapter 24]): 



\V(W, 
For U,VeW, 



n 



£-1 

; i n n-i i 



i=0^ 



D(U, V) = dim(U) + dim(V) - 2 dim(U n V) 

be a distance measure between [/ and V in the Grassmanian metric (see [13]). We use the 
notation d(n, v) for the Hamming distance between two vectors u and v of the same length. 

We say that the code C is an [n, £, log q (M),D] q subspace code, if it is represents a col- 
lection of subspaces in an ambient space W over ¥ q , which satisfies the following conditions: 

1. W is a vector space over ¥ q and dim(jy) = n; 

2. for all V G C, dim(V") = £; 

3. |C| =M; 

4. for all C/, V G C, C/ ^ V, it holds that dim(C/ D V) < £ - D, so that consequently 
D(t/, V) > 2D. 

A linearized polynomial is a polynomial of the form 

fc-i 



L(x) = ^L 
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where all Li G F. It can be easily checked that linearized polynomial is a linear transforma- 
tion from a vector space associated with F into itself. 

Given these definitions, we recall the construction in [13J. Let F = ¥ qm be an extension 
field of W q , m > 1. Then, F is a vector space over ¥ q . Let {«i, 0=2, • • • , o^} C F be a set of 
£ linearly independent elements in F, and let 

A = (0:1,0:2, • •• ,OC£) , 

and 

^ = A©F = {(t)i,«2):t)i6A,t)2eF}. (1) 

Note that we wrote v G W as v = (v 1 ,v 2 ), where V\ G (A) and v 2 G F (consequently, the 
vector f can be viewed as an (i + m)-tuple over ¥ q , where the first £ symbols in ¥ q describe 
the coordinates of Vi, and the last m symbols describe the coordinates of v 2 ). For a vector 
space V C W we define as before 

V| F = {« 2 GF : («i,« 2 )€F} . 

Let ¥ k [x] denote the set of linearized polynomials f(x) over F of degree at most q k ~ 1 , 
k > 1. Define the mapping £ : F fc [x] -^-V(W,£) as 

£(/(x)) = ((ai,/(ai)),...,(a ft /(a / ))>. 

Using linearized polynomials, one can introduce a code /C (for fc < £) as a collection of 
subspaces of the form 



£ = {£(/(*)) : /W6F fc [x]} . (2) 

The code /C can be easily shown to be an [£ + m, £, m/c, > 2(£ — k + l)] q subspace code 



To formalize the network model, the authors of [13] also introduced the operator channel 
and erasure operator as follows. Let k > be an integer. Given a subspace V C W, 
if dim(U) > k the stochastic erasure operator T-LkiV) returns some random k- dimensional 
subspace of V . Otherwise it returns V itself. Then, for any subspace U in W, it is always 
possible to write U = TikiV) ®E, where %k(y) is a realization of U C\V and E is a subspace 
of W. 

Decoding algorithms for the code /C were presented in [T3] and [2"Uj . Suppose that F G /C 
is transmitted over the operator channel. Suppose also that an (£ — k + 7)-dimensional 
subspace U of W is received, where A; = dim(?7 fl7) = £ — k. Here k = £ — k denotes 
the number of dimension losses when modifying the subspace V to V (1 U, while 7 similarly 
denotes the number of dimension insertions needed to transform V D U into £/. Note that 
dim(.E') = 7, where £? is given in the decomposition of U. The decoders, presented in [T3] 
and [20J, are able to recover a single V & fC whenever K + 7 < £ — k + 1. We denote hereafter a 



decoder for the code K described in [13] and [20] by V^. Note that the decoding complexity 
of T>jc is polynomial both in the dimension of the ambient vector space and the subspace 
distance D. 

We find the following lemma useful in our subsequent derivations. 

Lemma 2.1. Let W = U' © U" be a vector space over ¥ q , and let Vi, V 2 C W be two 
vector subspaces. Then 

D(V 1 ,V 2 )>D(V 1 \ U ,,V 2 \ U ,). 



Proof. By definition, 

D(y 1; V 2 ) = (dim(Vl) - dim(Vi n V 2 )) + (dim(V 2 ) - dim(Vi n V 2 )) . 

Let s = dim(Vi D V 2 ) and t = dim(Vi) — dim(Vi fl V 2 ). Take {v\,v 2) --- ,v s } to be a 
basis of V\ fl V 2 and {ui, u 2 , • • • , u t } to be t linearly independent vectors in V^V^. Then, 
{v\, v 2 , • • • , v s } and {«i, u 2 , • • • , u t } jointly constitute a basis of Vi. 

Next, consider (Vi n V 2 )\u> = ((vi)\u',(v 2 )\u',--- ,(v s )\u'}- Clearly, (VI n V 2 )\ v > is a 
subspace of V\\xji and of V 2 \u'- Therefore, 

{v 1 nv 2 )\ u ,QV x \ v ,nv 2 \ v ,. 

Note that the inclusion in the above relation may be strict. 

On the other hand, B\ together with {(wi)|c/', (""2)1(7', ■ • • , ( u t)\u'} spans Vi\u>. We thus 
have that 

dimiVtlw) - d3m(Vi\ uf nV 2 \u') < dim(Vl| [/ -dim(Bi) < t = dim(Vi) -dim(Vi nV~ 2 ) . (3) 

Similarly to (JSJ) , it can be shown that 

dim^lt/O - dim(Vl|t// n V 2 \ w ) < dim(V 2 ) - dim(Vi n V 2 ) . (4) 

From ([3]) and (jlj), we obtain that 

D(Vi|i;/, V 2 \u>) = (dim(Vl|i/0 - dim(Vi|i// n V^/)) + (dim(V 2 |[/') - dim(Vi|^ n V 2 |cr')) 
< (dim(Vl) - dim(Vi n V 2 )) + (dim(V 2 ) - dim(Vi n V 2 )) 
= D(V 1} V 2 ). 

This completes the proof of the claimed result. □ 



3 Hybrid Coding for Symbol Erasures and Dimension 
Gains/Losses 

3.1 Motivation 

Noncoherent network coding makes the topology of the network transparent to the code 
designer, and it has a strong theoretical foundations. Nevertheless, there are some practical 
issues that remain to be taken into account when applying this coding scheme. First, the 
notion of "dimension loss" is fairly abstract since in networks only symbols (packets) can be 
erased or subjected to errors. It is reasonable to assume that a dimension loss corresponds 
to a number of symbol erasures/errors within the same message, although it is not clear how 
large this number is supposed to be. In the worst case scenario, even one symbol error may 
lead to the change of one dimension. Second, the achievable throughput of the scheme and 
the underlying decoding complexity may be significantly inferior to those achievable only 
through classical network coding. Of course, this claim only holds if the error- correcting 
scheme is integrable with a linear network coding method. 

Let Wc denote the space F™ for some n£N and let £ be a collection of subspaces of Wc 
of dimension £. Assume that V G C is transmitted over a noncoherent network. Assume also 
that p symbol errors and p symbol erasures happened to any of the vectors of V, while they 
were propagating through the network. Denote by U the subspace spanned by the vectors 
obtained at the destination. 

Then, the vectors observed by the receiver are linear combinations of the vectors in V. 
Each of these vectors has, in the worst case scenario, at most p symbol errors and p symbol 
erasures. Indeed, this can be justified as follows. If some vector x was transmitted in the 
network, and an erasure (or error) occurred in its j-th entry, in the worst case scenario 
this erasure (error) can effect only the j'-th coordinates in all vectors in U, causing this 
coordinate to be erased (or altered, respectively) in all of them. This is true for any network 
topology. Such erasure (or error) does not effect any other entries in the vectors observed 
by the receiver. 

This observation motivates the following definitions. 

Definition 3.1. Consider a vector space U C Wc- Write Wc = WsQ(e.j) for some 
1 < J ' < n. A symbol error in coordinate j of U is a mapping from U to U' C Wc, such that 



U ^U' and U\ Ws = U'\ 



w s 



Definition 3.2. Let U C Wc and assume that Wc = WsQ){e.j)- A symbol erasure in 
coordinate j of U is a mapping from U to U' C Ws, such that 

U\ Ws = U' . 



Observe, that symbol errors and erasures can be combined. Thus, we say that the vector 
space U' is obtained from U by p symbol errors and p symbol erasures, if there exist a series 
of p error mappings and p erasure mappings as above from U to U' . Note that under these 
definitions, the order of errors is irrelevant. Thus, if U' is obtained from U by errors in the 
coordinates {ji,J2, • • • ,j P } of U and erasures in the coordinates {j[,j' 2 , • • • ,j' T } of U, then 
any order of application of error and erasure mappings yields the same subspace U' . 

We summarizes the above discussion by mentioning that there are four potential types 
of data errors in a network that are not necessarily incurred independently: 

1. Symbol erasures; 

2. Symbol errors; 

3. Dimension losses; 

4. Dimension gains. 

In the forthcoming sections of this paper, we concentrate on designing codes that are able to 
handle simultaneously symbol erasures, dimension losses and dimension gains. We postpone 
the discussion about how to handle symbol errors to the Appendix. 



3.2 Code Definition 

We start the development with the following definition. 

Definition 3.3. A subspace code C C V(Wc,i) (a collection of subspaces in Wc of 
dimension i.) is called a code correcting d — 1 symbols erasures and D — 1 dimension errors or 
a (d,D) hybrid code if it satisfies the following properties: 

1. For any U G C, dim(C/) = L 

2. For any U,V e C, dim(U) + dim(F) - 2 dim(C/ n V) > 2D. 

3. Let V G C. Let V be the subspace obtained from V by p symbol erasures, where 
p < d — 1. Then, for any possible combination of p symbol erasures with p < d — 1, 
dimfV') = £ and the space V is the only pre-image ofV in C (under p symbol erasures). 

4- Let U,V G C Let U', V be obtained from U and V , respectively, by p symbol erasures, 
where p < d — 1 (here both U and V have erasures in the same set of coordinates) . 
Then, dim(C/') + dim(V') - 2dim(£/ / n V) > 2D. 



Observe that condition (1) is a special case of condition (3), and (2) is a special case of 
condition (4), and therefore the first two conditions can be omitted. We kept these conditions 
for the sake of clarity. 

We explain next why the class of hybrid (d, D) codes, satisfying (l)-(4), are termed codes 
capable of correcting d — 1 symbol erasures and D — 1 dimension errors. 

Theorem 3.1. Let £ C V{Wn, £) be a code satisfying (l)-(4). Then, C is capable of 
correcting any error pattern of d — 1 symbol erasures and D — 1 dimension errors. 

Proof. Suppose that V G C is transmitted through the operator channel, and that the 
subspace U G V((¥ q ) n ~ d+1 , £') is received, where d — 1 symbols erasures and D — l dimension 
errors have occurred. Note that here £' is not necessarily equal to £. 

We assume without loss of generality that the dimension errors occurred first, and are 
followed by symbol erasures. As pointed out before, the order in which errors occur is 
irrelevant. 

More formally, let S = {ji,jz, • • ■ , jd-i} Q [n] be a set of erased coordinates in U, and 
let W c = WsQie^e^, ■ • • , e^). Then, 

U x =Hk{V)@E and U = U 1 \ Ws , 

where dim(V n U{) = k, dim(f/ 1 ) = £', and £ + £' - 2k = D - 1. 

We show that if C satisfies properties (l)-(4), then it is possible to recover V from U. 
Indeed, consider the following collection of subspaces 

£ = {V\w s c (¥ q ) n - d+1 ■ VeC}. 

Take any Vi,V 2 G C. By property (3), dim(Vi) = dim(V2) = £, and by property (4), 
dim(K) + dim(y 2 ) - 2dim(y 1 n V 2 ) = 2D. Therefore, C is a [n - d + l,£,log g |£|,2D] g 
subspace code. It is capable of correcting of up to D — 1 dimension errors in (F g ) n +1 . 



Denote V = V\w s G C. Then, from Lemma [2. l\ 

D(V, U) < D(V, U 1 ) = D-1 . 

We conclude that there exists a (not necessarily efficient) bounded-distance subspace decoder 
that is capable of recovering V from U. 

Finally, observe that V is obtained from V by erasing d—l coordinates indexed by S. 
From property (3), the pre- image of V under these erasures is unique. Therefore, V can be 
recovered from V . □ 

Henceforth, we use the notation [n, £, log (M), 2D, d]Jj to denote the subspace code £ C 
V(W, £) with the following properties: 



1 Whenever it is apparent from the context, we omit the subscript q. 



1. dim(H / ) = n; 

2. for all V eC, dim(F) = t\ 

3. \L\ =M; 

4. £ is a code capable of correcting d — 1 symbols erasures and D — 1 dimension errors. 

Remark 3.1. JTie intuition behind the definition of hybrid codes is that dimension losses 
occur due to symbol erasures or errors. Symbol erasures are "easier" to correct than dimen- 
sion losses, and upon correcting a number of symbol erasures one is expected to reduce the 
number of dimension losses. These claims are more rigorously formulated in Section 5. 3. 

4 Bounds on the Parameters of Hybrid Codes 

In this section, we develop the Singleton and the sphere-packing bound for hybrid codes 
handling dimension losses and gains, and symbol erasures simultaneously. 



4.1 Singleton Bound 

Assume that a vector space W over ¥ q has dimension n, and let C C V(W, £) be a subspace 
code. In what follows, we use a puncturing of the code C, which is similar to symbol erasure 
in all V G £, but has a different assumption on the receiver's knowledge. Specifically, we 
use the following definition. 

Definition 4.1. Puncturing of the code £ at position j is equivalent to the definition of 
erasure at coordinate j in Definition \3.S[ The only difference is that in Definition \3.S\ it is 
assumed that the receiver knows which coordinate was erased, while in this context no such 
knowledge is assumed. 

Theorem 4.1. Let C be a code of type [n, £, log „(M), 2D, d] in the ambient space Wc- 
If d > 1, then coordinate puncturing at coordinate j yields a code with parameters 
M,log ff (M),2D,>d-l]. 



\n 



Proof. Let £ be a code obtained by puncturing of the j-th coordinate in all vectors spaces 
in £. Thus, 

£ = {V : 3V € C and V is V punctured in coordinate j} . 

Clearly, the dimension of the ambient space decreases by one under this puncturing, and 
so the resulting space W satisfies dim(W) = n — 1. 

10 



Let Ve£. Since d > 1, by property (3), dimfV') = £ and V has a unique pre- image. 
Therefore, \C\ = \C\. 

The fourth parameter follows from the property that £ is a code correcting d — 1 symbol 
errors and D — l dimension errors. Thus, dim(C/ / ) + dim(V r/ ) — 2dim(U'r\V) > 2D, where U' 
and V' are obtained by puncturing of U and V, respectively. The value of the last parameter 
follows from the fact that each subspace in C is obtained from its pre-image in C by an 
erasure in the j-th coordinate. □ 

Theorem 4.2. The size M of the [n,£,log q (M),2D,d] q code C satisfies 

M<A q {n-d+l,£,2D) , 

where A q (n, £, 2D) stands for the size of the largest subspace code [n, £, M', 2D] q . 

Proof. We apply d — l coordinate puncturings to C. The resulting code has the same number 
of codewords as C, and it is a set of £ dimensional subspaces in a n — d + 1 dimensional 
space, whose pairwise intersection is of dimension < £ — D. In particular, its size is upper 
bounded by A q (n- d+l,£,2D). D 

Corollary 4.3. From the Singleton bound in /73] /. the size M of the [n, £, log (M), 2D, d] q 
code C satisfies 

' n-d- D + 2 ~ 

e-D + i 



M < 



(5) 



J 9 



We use the following result from [13]. 

Lemma 4.4 (Lemma 4 in [13]). The Gaussian coefficient ["] satisfies 

.r\ In 



1 < q~ 



<4. 



We also use the following definition of the rate of the subspace code. 
Definition 4.2. The rate of the subspace code C is defined as R = — ^ — . 

Next, let 

x £ a D , r d 
A = — , A = — and o — — . 

n £ n 

Thus, an asymptotic version of the latter bound reads as follows. 

Corollary 4.5. The rate of the [n,£, log (|£|), D,d] q code C satisfies 
fl<(l-A-I)(l-*-A + I) + .(l). 



11 



4.2 Sphere-Packing Bound 

Let We be ambient space F™, and let < £ < n. Fix two integers T G [0, n], and £ G 
[0, n]. Two vector spaces U,V € V(Wc,£) are called iT^i)- adjacent if there exists a set of 
coordinates S = {i\, i 2 , ■ ■ ■ , i s } Q [n], s < t, and a vector space Ws such that 

Wc = W s Q{e h ,e l2r -- ,e is ) , 



and 



D(U\ Ws ,V\ Ws )<T 



Note that the adjacency relation is symmetric with respect to the order of U, V, namely U 
and V are (T, t)-adjacent if and only if V and U are (T, t)-adjacent. 

Assume that the code £ is used over the network. Let XJ\ G C be transmitted, and let the 
space V G V(Wc, i) be received as a result of T dimension erasures or gains, and t coordinate 
erasures. Then, U\ and V are (T, t)-adjacent. If there is no other codeword U^ G C such that 
U2 and V^ are (T, t)-adjacent, then the decoder, which is able to correct t coordinate erasures 
and T dimension erasures/gains, can recover U\ from V. This observation motivates the 
following definition. 



Definition 4.3. Let Wc be a vector space F^ ; and let V G V(W C ,£) 
S(V,£,T,t) around V is defined as 

S(V,£,T,t) = {U G V(W £ ,£) : V andU are (T,t)- adjacent} . 



The sphere 



Now, we recall a result from [13] , which we use to provide a lower bound on the number 
of subspaces in the sphere S(V, £, T, t). 

Theorem 4.6 (Theorem 5 in pi]). For any V G V(W C ,£), anyO<T< 21, 



\s{v,e,T, 



T/2 

£• 

i=0 



n-l' 

i 



We generalize this theorem in the following way. 

Theorem 4.7. Let C be a [n,£,\og q \£\,D,d] q code. For any V G C, any < T < 2£ 
and any < t < d, 



t , x T/2 

n 



is ( ^,T,t)i>£r .£ 



s=0 



i=0 



n — s 
i 
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Proof. For a particular set S of cardinality s of punctured coordinates, let V = V\w s - Since 
s < d, dim(y') = £. We also have dim(Ws) = n — s. Thus, 



\S(V',£,T, 



r/2 

IY 

i=0 



n — s 
i 



There are ( n ) ways to choose the punctured coordinates. For each basis vector (of the 
row echelon form) there are q s possibilities to fill in these missing entries. Since there are £ 
such vectors, we obtain 



\ S (v,e,T,t)\>j: r «"■£ 



s=0 



r/2 



i=0 



n — s 
i 



D 



From Theorem \A.7\ the following sphere-packing-type bound is obtained. 

Corollary 4.8. Let C C V(Wc, £) be a code that corrects d — 1 symbol erasures and D — l 
dimension losses/gains. Then 



\C\ < 



\V(W C 



< 



H 



\s(v,£,D-i,d-i)\ - EtJC)^-E^ l)/2 ^ 2 Brn 



(6) 



Now, we turn to an asymptotic analysis of the bound §6§. From Lemma |4.4[ we obtain 



\C\ < 



^D-l ,'2 



d-1 



^D-l 



ES v ls ■ Ef=o ^ a +*c-o-H(»— <-o ^ : j (») g«- . ^ <rc»— <> 



If D — l < (n — s)/2, then the dominating term in E i= ^ g l ( n_s_l ) is obtained when z = D — l. 
Similarly, if £ < n/2, since D — 1 < £, then clearly the dominating term in the denominator 
is obtained when s — d — 1. By putting all these together we obtain that 



\C\ < 



4 q Hn-e) 



4qi(n-e) 

~ q nh 2 ((d-l)/n)\og q 2+£(d-l) . (D-l)(n-d-D+2) 

= 4 :a e ( n - d - e + l )- nh '4( d - l )/ n ) l °Sq 2-(D-l)(n-d-D+2) 

where h 2 (x) = —x\og 2 { x ) — (1 — s)log 2 (l — x) denotes the binary entropy function, and 
f(x) = g(x) means that two expressions f(x) and g(x) are asymptotically equal. 

By taking base-g logarithm and dividing by £n, we obtain the following result. 
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Corollary 4.9. Let C C V(Wc, £) be a code that corrects d — 1 symbol erasures and D — l 
dimension losses/gains. Then, its rate satisfies: 



R< 1 



5-X + 



n 



A 



AA 



l-ho[SX 



log, 2 



•o(l) 



5 Hybrid Code 

Next, we construct a hybrid code, which is capable of correctiing of dimension losses/gains 
and symbol erasures simultaneously. We show that this code is asymptotically optimal with 
respect to the Singleton bound. We also provide some examples discussing hybrid codes and 
subspace codes. 



5.1 Code Construction 



Let W be a vector space (¥ q ) m+ of dimension m + £, and let C be a set of subspaces of W 
of dimension £, such that for any U, V e C, V ^ U, dim([7 fl V) < £ — D. We fix a basis of 
W, and denote its vectors by {w 1; u 2 , ■ ■ • , u m+ i}. We denote the decoder for the subspace 
metric and C by T>£. 

Let G be a (£ + m) x n generator matrix of the [n, £ + m,d] Generalized Reed-Solomon 
(GRS) code C over ¥ q of length n = £ + m + d— 1, given by 

/ 1 1 1 ... 1 \ 



G 



V 



a i 






1 

a 2 



1 

03 



J+m-1 



J+m-1 



l+m-1 



J 



( 



\ 



>h 







m 



o \ 



Vn J 



Here, a t e F g , 1 < i < n denote n distinct nonzero field elements, and rji G F g , 1 < i < n is 
an arbitrary nonzero element (see [T71 Chapter 5] for more details). 

We use the notation G{ for the i-th row of G, for i = 1, 2, • ■ ■ ,£ + m. The code C is 
capable of correcting any error pattern of p errors and fi erasures given that 1p + fi < d — 1. 
In this section, we are particularly interested in the case when p = 0. 

Denote by V RS the decoder for the code C. Consider a field F g n, which can also be viewed 
as a vector space (F g ) n , denoted by Wc- Let A be an (£-\-m) x (£ + m) matrix over F g such 
that 

Vi = 1, 2, • • • , I + m : e^ = ttjA , 



and therefore 



Vz 



1,2,...,. 



+ m : Gj = WjAG . 
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Clearly, such an A exists since {e^} and {u{} are two different bases for ¥ f q +m . 

We define a linear mapping Sc '■ W — >• VFc as follows. For an arbitrary vector v G W, 

£ £ (t>) = vAG . 

This mapping, with a slight abuse of notation, can naturally be extended to the mapping 
Sc '■ V(W) — > V(C), where V(C) stands for a set of all linear sub-codes of C. For any 
V G V(W), we have 

S C (V) = {vAG : v e V} . 

It is easy to see that Sc is a linear mapping, and that the image of the linear space V is 
a linear space. Moreover, it is straightforward to show that this mapping, when applied to 
subspaces of W, is one-to-one. Thus, for any V G W, 

dim(F) = dim(S c (V)) . (7) 

One can check that for any U, V E W, there holds 

dim(U nV) = dim(S c (U) n S C (V)) . (8) 

Next, we define a code C G V(Wc,£) as 

C = {S C (V) : VeC} . 
Theorem 5.1. The code £ is a hybrid code over¥ q , with parameters [n, £, |C|, > 2D, > d]. 

Proof. It is straight-forward to verify the first two parameters of C. The third parameter 
follows from the fact that |C| is the number of subspaces in C, and two different subspaces 
are mapped onto different subspaces under Sc- 

Next, we show that £ is a code capable of correcting d — 1 symbols erasures and D — 1 
dimension gains/losses. It suffices to show the following two properties: 

1. Let V G C. Let V be the subspace obtained from V by any \x symbol erasures, such 
that \x < d — 1. Then, dim(V r/ ) = £ and the space V is the only pre- image of V in L. 

2. Let U, V G C Let U', V be obtained from U and V, respectively, by \x symbol erasures, 
such that \x < d — 1. Here, both U and V have erasures in the same set of coordinates. 
Then, dim(C/ / ) + dim(F') - 2 dim(C/ / n V) > 2D. 

Indeed, these two conditions can be shown as follows. 
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1. Let V (z C and V be obtained from V by fx symbol erasures, such that \x < d — 1. 

Let {«i, v 2 , • • • , i^} be a basis of V, and let {v[, v' 2 , • • • , t;^} be a set of corresponding 

vectors obtained by \x symbol erasures. Then, for any ai,a 2 , ■ ■ ■ ,ae G ¥ q , not all of 

which are zero, 

i 

V = 2_] a i v i G V 
i=l 

is a vector of the Hamming weight > d. Therefore, after applying \x symbol erasures, 
the Hamming weight of the resulting vector 



»' = £ 



a i v i 
i=i 



is at least d — (d — 1) > 1, for any ai, a 2 , • • • , ae G F g , not all Oj's are zero. Therefore, 
the vectors {v^, v 2 , • • • , v' £ } are linearly independent, and thus dim(V') = L 

Next, take a vector v' = ^2 i=1 aiv' { G V. Since the minimum distance of a code C is 
d, and thus the code can correct any pattern of up to d — 1 symbol erasures, the only 
possible pre-image of v' under any fj, symbol erasures, /j, < d— 1, is v — Yli=i a i v i *= ^ ■ 
Therefore, each V G CJ has a unique pre-image. 

2. Let U, V G C and let U', V be obtained from U and V, respectively, by /i symbol 
erasures, such that \x < d — 1. 

From part (1) of the proof, dim(C/') = dim(y) = I. It is sufficient to show that 
dim(f/ / nl / ') < dim(C/ny). Assume, on the contrary, that dim(£/'rW) > dim(£/nV). 
This means that there exists u G C/ and i> G V, u ^ v, such that by applying // 
symbol erasures, the resulting vectors tt' and v' obtained from u and f, respectively, 
are equal. Recall, however, that u, v G C, and therefore d(u,v) > d. We hence arrive 
to a contradiction, and thus dim(fJ') + dim(V") - 2dim(C/' n V") > 2D. 

D 

The following generalization of property 1 above holds. 

Corollary 5.2. Let V be a subcode of C (of any dimension). Let V be obtained from V 
by arbitrary \x symbol erasures, such that \x < d— 1. Then, dim(V') = dim(V) and the space 
V is the only pre-image of V' in V(C). 

The proof is analogous to the proof of property 1. 
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5.2 Asymptotic Optimality 

In Section H] we derived some upper bounds on the size of more general codes. These bounds 
imply upper bounds on the size of the code C. Moreover, the code £ is asymptotically 
optimal with respect to one of these bounds. 

Consider the code £ constructed from the subspace code with parameters [m+£, £, log g |C|, 2D]^ 



and a classical GRS code with parameters [n, m + £, d] 



<i- 



n 



+ m + d — 1, as described in 



the previous section. The resulting code C is a [n, m + £, log g |C|, 2D, d] q code. The number 
of codewords of the code is |C|. If we take |C| as described in [13], then the g-ary logarithm 
of the number of the codewords is given by 

log g |C| =m(£-D + l). 

Therefore, the log of the cardinality of C is given by 

log g \£\ = m(£ - D + 1) = (n - £ - d + l){£ - D + 1) . 

Hence, from Lemma 14.41 



(9) 



n-d-D+2 

e-D + i 



< \c\ < 



n-d-D+2 

e-D + i 



Thus, the code is asymptotically order-optimal (i.e., optimal up to a constant factor) with 
respect to the Singleton bound (jSJ). 



5.3 Examples: Hybrid versus Subspace Codes 

Let K be the code defined in (J2J). When the code K is employed over a noncoherent network, 
in the worst case scenario each symbol erasure translates into the loss of one dimension, 
and each symbol error translates into one dimension loss and one erroneous dimension gain. 
This may happen when all the erasures and errors occur in linearly independent vectors. In 
addition, note that requiring each linearly independent vector to be able to correct up to 
and including d — 1 erasures is somewhat restrictive, since it imposes an individual, rather 
than joint constraint, on the total number of erasures in the transmitted subspace. 

We show next the advantage of using the code C in the case where all data errors in the 
noncoherent network take form of symbol erasures. These symbol erasures are the cause of 
dimension losses. In this case, the code L has more codewords than /C while having the same 
overall error- correcting capability. 

Example 5.1. Consider the code /C with parameters [m + £,£,mk,2{£ — k + 1)] = 
[6, 3, 3, 6] g . This code can correct up to and including two dimension losses. If the symbol 
erasures happen in the linearly independent vectors, the result is a loss of two dimensions, 
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and the code can provably correct such two symbol erasures. (Alternatively, one symbol 
error results in one dimension loss and one dimension gain, which is also correctable by this 
code.) However, this code is not able to correct any combination of three symbol erasures 
that occur in different basis vectors. Note that the code contains q 3 (subspaces) codewords. 

Now, let W = (F 9 ) 4 and consider the set V = V(W, 3), where \V\ = [J . Fix some basis 

{ui}f =1 for V . Let C be [6, 4, 3] q GRS code (for q > 5). Define the mapping £c '■ W — > C as 
in Section 15.11 

The resulting code C has 

.4 _ i 



3 



L J q 



Q 



q 



codewords (subspaces), for all q > 5. It has parameters [6, 3,log g f^zr) > — 2;3] g . Since 
C has a minimum distance 3, C can correct any two symbol erasures. If those appear in 
different basis vectors, the dimension loss error correcting capability matches that of the 
previously described subspace code. But the number of codewords in the code is strictly 
larger than that in /C. 

The increase in the number of codewords achieved through hybrid coding in the above 
scenario is negligible for large field orders. Furthermore, even these modest advantages are 
present only in cases when the symbol erasures (or errors) do not appear in bursts within a 
few linearly independent vectors. 

However, the advantage of the new construction is significantly more pronounced when 
the gap between £ and m is large. This gap becomes of interest when the length of data 
transmitted in the network is much higher than the dimension of the subspaces used in 
coding, or in other words, when the min-cut of the network is small compared to the length 
of the coded vectors. 

Example 5.2. Take the code /C with parameters [m+£, £, mk, 2(£—k+l)] = [12, 4, 16, 6] ? . 
This code can correct up to and including two dimension losses and it contains g 16 codewords. 

For comparison, take W = (IF,?) 10 and consider the set V = 7 7 (W, 4), where \P\ = [ 4 ] . 
Fix some basis {ui}}^ for V. Let C be a [12, 10, 3] q GRS code, with q > 11. Define the 
mapping £ c : W — > C as before. 

The resulting code C has parameters [12, 4, log^ ( [ 4 ] j , > 2, 3] q . Since C has a minimum 
distance 3, £ can correct any two symbol erasures. 

The number of codewords in the code equals 



10' 

4 



{q io . 1){q 9 _ 1){q s _ 1){q 7 _ x) ^ 



This number is strictly larger than 4g 16 (for all q > 11), which is an upper bound on the size 
of any [12, 4, 16, 6} g subspace code [IB"] . 

The examples described above motivate the following question: how many symbol era- 
sures should be counted towards one dimension loss for the case that the subspace and hybrid 
codes have the same number of codewords? 

To arrive at the desired result, we use an upper bound on the size of £, which was derived 
in [13]. Therefore, our findings are also valid for the codes constructed in [131 EH H], as well 
as for any other possible subspace code. 

Let us fix the values of the parameters n and £. Once again, we recall that in the worst 
case, each symbol erasure can cause one dimension loss. We use the Singleton bound on the 
size of const ant- dimension codes of minimum distance 2D [TBI Theorem 9] . Any such code £ 
is capable of correcting D — 1 dimension losses, so in the worst case scenario, it can provably 
correct only up to D — 1 symbol erasures. From [IBJ Theorem 9] we have 



\C\ < 



n-D + 1 

e-b + i 



< Aq 

9 



(e-D+i)(n-e) 



In comparison, the number of codewords in the code constructed in Section 15.11 is given by 

\r\ — n (t-D+l)(n-i-d+l) 

In order to achieve the same erasure-correcting capability, we set D — 1 = (D — 1) + (d — 1). 
The underlying assumption is that D — 1 symbol erasures are corrected as dimensional losses, 
while the remaining erasures are handled as simple erasures. We require that, for small e > 0, 

{£-(£>- l))(n - £) + e < {I - (D - l))(n - £ - (d - 1)) . 

This is equivalent to 

(e-(D-l)-(d- l))(n - £) + e < (£ - (D - l))(n - i - (d - 1)) , 

or 

-(d -l)(n-£) + e<-(£-(D- l))(d - 1) , 

which reduces to 

(n - 2£ + (D - l))(d - 1) >e. (10) 

The latter inequality holds for any choice of D > 2 and d > 2, when n > 2£ + e', for some 
small e' > 0. When the inequality (flOl is satisfied, hybrid codes correct more symbol erasures 
than any constant-dimension subspace code, designed to correct dimension errors only. 

Next, we consider maximizing the number of codewords in L under the constraints that 
[D — 1) + (d — 1) = D — 1, and D > 1, d > 1, where D is fixed and D, d are allowed to vary. 
Recall that 

\og g (\£\) = (£-(D-l))(n-£-(d-l)). 
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Let x = d — 1 so that D — 1 = s — x, where s = D — 1 is a constant. We aim to maximize 
the function 

(£-s + x)(n-£-x) . (11) 

By taking the first derivative of the expression with respect to x and by setting it to zero, we 

find that x max = ^^ — £. Therefore, the value of d that maximizes the number of codewords 

equals 

_ n+ p + l 

"'Opt r, *-• 

If n > 2£, then under the given constraints, the optimal value of d equals d opt = D. 

Consider the expression for the number of codewords in (ITTj) . There are two types of sub- 
space and symbol errors considered: dimension losses and symbol erasures. A combination 
of such errors is termed an error pattern. 

Assume that for a specific code C, correcting a dimension loss is on average equivalent 
to correcting c symbol erasures, for some c > 0. 

If the error pattern consists of no dimension losses and d — 1 symbol erasures, then ffTTl) 
becomes £(n — £ — (d — 1)). In comparison, if the error pattern consists of D — 1 dimension 
losses and no symbol erasures, then (TTT1) becomes (£— (D — l))(n — £). Since each dimension 
loss is on average equivalent to c symbol erasures, we have 

{£ - (d/c - l))(n -£)=£{n-£-{d- 1)) . 

After applying some simple algebra, we obtain 

1 £ 



n — £ V d J d 



! 



and thus c ~ (n — £)/£■ Therefore, vaguely speaking, it is as hard to correct one dimension 
loss as to correct (n — £)/£ symbol erasures. 



6 Decoding 

We proceed to present an efficient decoding procedure which handles symbol erasures, di- 
mension losses and dimension gains. Note that the proposed decoding method may fail in 
the case that symbol errors are also present. This issue is discussed in more details in the 
Appendix. 

As before, assume that V G C is transmitted over a noncoherent network. Assume also 
that U G V(Wc,£'), where £' is not necessarily equal to £, was received. 

Let U' denote the vector space U, where all erased coordinates are deleted. Similarly, 
let C denote the code C where all coordinates erased in U are deleted. We first compute 
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U' = C n U', the intersection of U' with the subspace spanned by the code C. Assume 
that {7i,72, • • • il'i"} are basis vectors of U' (when all erased coordinates are deleted), and 
■y'i G (F g U {?}) n . We apply the erasure- correcting GRS decoder Vrs of the code C on each 
-y[ so as to obtain j t . Let U = (7i,7 2 , • • • ill")- We proceed to apply the inverse of the 
mapping £ c , denoted by £~£ , to U. The resulting subspace V is a subspace of W, on which 
we now run the decoder for the code C. 

The algorithm described above is summarized in Figure [TJ 



Input: U C (F, U {?}) n . 

Let [/' be the space [/, where all erased coordinates are deleted. 

Let C be the code C, where all coordinates erased in U are deleted. 

Let U' = C'n U'. 

Denote U> = (-/[, -y' 2 , ■ ■ ■ , 7^). 

For i = l,2,...,riet7 i =D J j S (7 / i ) . 

Let U = (7i,7 2 >--- ,7/")- 

Let F = El\tJ). 

Let y = P c (V r )- 

Output: Vq. 

Figure 1: Decoder for dimension errors. 

This decoder can correct any combination of G dimension losses and Q dimension gains 
such that G + Q < D — 1, and at most d — 1 symbol erasures. This is stated in the following 
theorem. 

Theorem 6.1. The decoder in Figure [J\ can correct any error pattern of up to d — 1 
symbol erasures and up to D — 1 dimension errors in C. 

Proof. Suppose that V G C is transmitted through the operator channel and that U C 
(F g U {?}) n is received, where /x symbols erasures and G + Q dimension errors have occurred, 
such that fj, < d — 1 and G + Q < D — 1. 

We assume that the dimension errors occurred first, followed by symbol erasures. More 
specifically, let S = {ji, J2 5 • • ■ ,j/j} ^ [n] be the set of erased coordinates in U, and let 
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W c = W s Q(e n ,e h , ■■■ ,e jfl ). Then, 

U t = H k {V) © E and U' = U^w, , 

where dim(V fl Ui) = k, dim(L r i) = £', and (£ — k) + (£' — k) = + Q. Here we assume that 
JJ\ is obtained from V by applying dimension losses/gains only. We also assume that some 
vectors in U contain entries marked with '?', and so U' is obtained by deleting those entries 
from all vectors in U (or, equivalently, from vectors in U{). 

Denote V = V\ Ws C C . Then, by Lemma EU 

D(V',U') < D(V,C/i) <D-1. 

We have dim(V) = dim(V), dim(C/ / ) < dim(C/')- Recall that f/' = C'n £/'. Therefore, 
since V C C, we have 

dim(V n W) = dim((U' n C) n V 7 ) = dim(U' n (C n V 7 )) = dim(C/' n 7 ; ) . 

We consequently obtain 

D(V',U') = dim(V / ) + dim(tr / )-2dini(E7 / nV r/ ) 

< dim(V') + dim(U') - 2 dim(U' n V) 
= D(V',[/') 

< D-l. 



Observe that by Corollary 15.21 it follows that dim(V) = dim(V) and dim([7) = dim ([/'). 
Moreover, U' fl V' can be obtained from U fl V by fi symbol erasures. Then, according to 
Corollary E21 dim(U' n V) = dim(U r\V). We conclude that D(V, U) = D(V, U') < D - 1. 

Finally, due to © and ©, we have that D(£ C \V),£ C \U)) < D - 1. Therefore, the 
decoder D c for the code C is able to recover S^iV), as claimed. □ 

The decoding algorithms in Figure [TJ consists of the following main steps: 

• Computation of the vector space U' = C fl U'. 

Observe that dim(£') = m + £ and dim(C/') < £', where £' < £ + D <2£, since otherwise 
the decoder cannot correct D dimensional errors. Therefore, this computation can be 
done by solving a system of at most n equations over ¥ q with m + £ + £' unknowns. By 
using Gaussian eliminations, this can be done in time 0(n 2 (m + £+£')) < 0(n 2 (£+m)). 

• £" applications of the decoder T>rs(-). 

Note that £" < £' < 2£. This requires 0(£"n log n) < 0(£nlogn) operations over ¥ q . 
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• One application of the mapping V = £^ 1 {U). 

As before, this step is equivalent to multiplying a £" x n matrix representing the basis 
of U by an n x (m + £) transformation matrix representing the mapping £^ (•)• This 
step requires O {£"{£ + m)n) = O (£(£ + m)n) operations over ¥ q . 

• One application of the decoder Vq(-). This takes 0(D(m + £) 3 ) operations over 
¥ q (see pH Chapter 5]). 

By summing up all the quantities we arrive at an expression for the total complexity of the 
presented algorithm of the form 

O (n 2 (£ + m)+£n logn + £(£ + m)n + D(£ + m) 3 ) < O ((£ + m)n 2 + D(£ + m) 3 ) 

operations over ¥ q . 

We note that the most time-consuming step in the decoding process for various choices of 
the parameters is decoding of a constant-dimension subspace code, which requires 0(D{m + 
£) 3 ) operations over ¥ q . However, if the error pattern in a specific network contains a large 
number of symbol erasures, we can design the code such that D is small (say, some small 
constant), thus reducing the complexity of the overall decoder. 
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A Correcting Dimensions and Symbol Errors 

We describe next how to use the code £, defined in Section [5TT1 for correction of error patterns 
that consist of dimension losses, symbol erasures and symbol substitutions. We show that 
the code £ is capable of correcting any error pattern of up to dimension losses, p symbol 
errors and \i symbol erasures, whenever B < D — 1 and 2p + fj, < d — 1. However, we note 
that if in addition to dimension losses one also encounters dimension gains, the decoder for 
the code £ might fail. This issue is elaborated on in the second part of the Appendix. 



A.l Decoding 

Henceforth, we assume that V G £ is transmitted over a noncoherent network and that 
U G V(Wc,£'), where £' is not necessarily equal to £, is received. 
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Suppose that {7i,7 2 , • • • , 7>}> 7j ^ Q^« U {?}) n , are some basis vectors of [/. We apply 
the GRS decoder r> RS for the code C on all these vectors. This decoder produces the vectors 
{/3 1 , /3 2 ,---,/3£/}eC. We denote by U the span of these vectors. Then, we apply the inverse 
of the mapping Sc, denoted by £ c x , to U . The resulting subspace is a subspace of W, on 
which the decoder for the code C is applied. 

The decoding algorithm can be summarized as in Figure [21 



Input: U = ( 7l , 7 2 , ■ • • , 7e<), l t G (¥ g U {?})". 
For i = 1,2, ■■-,£' let fa = V RS (~/ i ) . 
Let U = (P 1 ,0 2 ,---,P t ,). 
Let V = El l {U). 



Let V = V C (V). 
Output: Vq. 



Figure 2: Decoder for symbol errors. 



Analysis of the Decoding Algorithm 

We analyze next the algorithm in Figure [2J The main result of this section is the following 
theorem. 

Theorem A.l. The decoder in Figured can correct any error pattern in L which consists 
of G dimension losses, p symbol errors and jjl symbol erasures, whenever O < D — 1 and 
2p + /j < d- 1. 

Proof. Suppose that V = (vi, v 2 , ■ ■ ■ , Vi) G C is transmitted through an operator channel, 
and that U = ("fi, 7 2 , • • • , 7^), 7« G (¥ q U {?}) n is obtained by the receiver. The assumption 
is that dimension losses, p symbol errors and \x symbol erasures have occurred. 

Let 7j be an arbitrary received vector, 7,- G U. Then, -y i can be obtained from a unique 
vector 7^ G F" by at most p coordinate substitutions and at most p coordinate erasures, 

where 

1 

li = Yl a 3 v i ' 
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and all dj G ¥ q , j = 1, 2, • • • ,£. Since 7^ is a linear combination of vectors in V, it follows 
that 7j G C Therefore, the decoder V R $ is able to recover ■y i from 7^. By using the structure 
of the algorithm, we conclude that (3 { = 7^ and so C/ = (/3 1 , j3 2 , ■ ■ ■ , fle) is a subspace of 1/. 

Since B dimension losses occurred, G < D—l, dim(V A )— dim(f/) < B and D(V, U) < D—l. 
Due to © and (ED, we have that D^ 1 ^),^ 1 ^)) < D - 1. Therefore, the decoder V c is 
able to recover S^iV) from S^^U), as claimed. □ 



Decoding Time Complexity 

The decoding algorithm consists of the following computational steps: 

• £' applications of a Reed-Solomon decoder, for codes of length n = m+£+d—l. 
By using Berlekamp-Massey type decoders, each decoder run can be performed with 
O(nlogn) operations over ¥ q . For this step, we hence have a total complexity of 
O(fnlogn)). 

• One application of the mapping V = E c l (U). 

First, we have to find a basis for U. Gaussian elimination requires 0(£' 2 n) operations 
over ¥ q . The mapping £^ {U) is equivalent to multiplying an £' x n matrix representing 
the basis of U by a n x (m + £) transformation matrix representing the mapping ££ 1 (-). 
The computation of this transformation matrix is done only once in the preprocessing 
step, and so we may assume that this matrix is known. We hence conclude that this 



step takes O (£'(m + £)n + £ n) operations over F, 



<r 



• One application of the decoder £>c(). 

This takes 0(D(m + £) 3 ) operations over ¥ q (see pjQ Chapter 5]). 

The total complexity of the presented algorithm equals 

O (£'nlogn + £'(£ + m)n + £' 2 n + D(m + £) 3 ) < O (Dn 3 + £'n 2 + £' 2 n) 

operations over ¥ q . 

The number of operations depends on the dimension of the received subspace, £'. It would 
hence be desirable to derive an upper bound on £' . However, since each linearly independent 
vector can carry a different pattern of symbol errors, the resulting dimension of U, £', may be 
rather large. However, if we assume that each link to the receiver carries only one vector, £' 
can be bounded from above by the capacity of the cut between the in-degree of the receiver. 
Note that the same issue arises in the context of classical subspace coding, although it was 
not previously addressed in the literature. 
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A. 2 Dimension Insertion and Decoder Failure 

The following example illustrates that the decoder in Figure [2] may fail in the presence of 
both symbol errors and dimension gains. 

Let {ei, e 2 , • • ■ , e 6 } C Fj?, q > 8, be a standard basis of W, let £ = 3, and let C C W be a 
subspace code with 2D = 6. The code C is able to correct up to and including two dimension 
losses and/or gains. Additionally, let {ui,u 2 ,--- ,u & } e F® be a basis of a [8, 6,3] 9 GRS 
code C. The code C is able to correct one symbol error. Assume, without loss of generality, 
that u 5 = (xi,X2, x%, 0, • • • ,0) G C is a codeword of a minimal weight in C. 

Assume that the sender wants to transmit the space Z = (ei,e 2 ,es) to the receiver. 
According to the algorithm, the sender encodes this space as V = £c(Z) = (u±, u 2 , n 3 ), and 
sends the vectors Ui, u 2 , u% through the network. Assume that the vector 1*3 is removed, 
and the erroneous vector z = ii 4 + (xi,0, ••• ,0) is injected instead. At this point, the 
corresponding vector space under transmission is (ui,u 2 ,z). Then, it is plausible that u 1; 
u 2 and z propagate further through the network due to network coding. To this end, assume 
that the receiver receives the following linear combinations, U\ + z and u 2 + z. Assume also 
that during the last transmission, the vector z is subject to a symbol error, resulting in 
z' = u 4 + (xi,x 2 ,0,-- ■ ,0). 

The receiver applies the decoder Vrs on these three vectors, resulting in 



We have that 
and 



^rs{ui + z) = u 1 + u i ; 

T> RS (u 2 + z) = u 2 + U4] 

T^rs{z') = u A + u b . 



U = (ui+ Ui, U 2 + Ua, Ua + u 5 ) 



V = (e 1 + e 4 , e 2 + e 4 , e 4 + e 5 ) . 

Observe that dim(Z fl V) = 1 and that e.\ + e 2 e Z fl V, so that the subspace distance 
between V and V is four. Therefore, the subspace decoder Vc may fail when decoding Z 
from V. This situation is illustrated in Figure [3] 

A. 3 Decoding Strategies for Dimension Gains and Symbol Errors 

To illustrate the difficulty of performing combined symbol and dimension gain error decoding 
of the code C, below we discuss some alternative decoding strategies. We mention why these 
strategies, when applied to the problem at hand, do not work. 
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Figure 3: Situation when the decoder fails. The dimension error vector z should be decoded 
into M4 G C. However, a symbol error changes z into z', which is decoded into u 5 e C. This 
ambiguity increases the dimension of the error space, and causes decoder failure. 

Gaussian eliminations on the orthogonal space. 

Assume that the vector space V is transmitted and U is obtained by a combination of 
symbol and dimension errors, including dimension gains. Then, U can be represented 
as U — 1-Lk(y) © E, where E is some error space. If there were no symbol errors, the 
dimension of E would equal the number of dimension gains, which is small. However, 
if symbol errors are present, E takes a more involved form. 

One can try to represent the space E in a particular basis, for example one in which the 
symbol errors have low weight. Ideally, each symbol error would correspond to a vector 
of weight one in that space. If one could accomplish this task, then one could find all 
the low weight vectors and remove them, or to puncture the corresponding coordinates. 
After such a procedure, one would be left with only dimension gain vectors. 

A particular difficulty is that there are too many different bases for E, and it is not 
immediately clear which basis should be taken. And, while in the right basis the symbol 
error vector will have weight one, in most of the other bases this weight will be large. 
Moreover, the space E can be viewed as a dual code of L. However, then the problem 
of finding low-weight vectors becomes similar to the problem of finding the smallest 
weight codeword in the dual code, which is known to be NP hard. Therefore, it is 
likely that finding the right basis in E is difficult, too. 

Using list-decoding for RS codes. 

One can think about using list-decoding for the code C. Since the known covering 
radius of RS code is d — 1, it may happen that the dimension error will transform the 
codeword into a vector at distance d— 1 from any codeword. Then, even a single symbol 
error can move this codeword to a different ball of radius d — 1 around a codeword, 
similarly to the situation depicted in Figure El Since list-decoding can correct only less 
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than d errors, list-decoding cannot recover the original codeword. 

The second problem associated with list decoding is as follows. Even if one could 
construct a polynomial- size list of all possible codewords before the dimension error 
took place, there would be a different list for each received vector. Since there could 
be as many as £ different lists, an exhaustive approach to picking the right codewords 
from all the lists would require time exponential in £. 



B Conclusion 

We introduced a new class of subspace codes capable of correcting both dimension errors 
and symbol errors, termed hybrid codes. For these codes, we derived upper bounds on the 
size of the codes and presented an asymptotically constant-optimal concatenated code design 
method. We presented polynomial-time decoding algorithms which are capable of correcting 
the following error patterns: 

• Dimension losses/gains and symbol erasures; 

• Dimension losses and symbol erasures/errors. 

We also discussed correction of error patterns that consist of all four types of errors: di- 
mension losses/gains and symbol erasures/errors. As we illustrated by the example, the 
corresponding task is difficult, and is left as an open problem. 



References 

[1] R. Ahlswede, N. Cai, S.Y.R. Li, and RW. Yeung, "Network information flow," IEEE 
Trans. On Inform. Theory, vol. 46, pp. 1204-1216, July 2000. 

[2] N. Cai, R. W. Yeung, "Network coding and error correction," Proc. IEEE Inform. 
Theory Workshop (ITW), Bangalore, India, pp. 119-122, Oct. 2002. 

[3] R. Dougherty, C. Freiling, and K. Zeger, "Insufficiency of linear coding in network 
information flow," IEEE Trans. On Inform. Theory, vol. 51, no. 8, pp. 2745-2759, 
August 2005. 

[4] T. Etzion, N. Silberstein, "Error-correcting codes in projective spaces via rank-metric 
codes and Ferrers diagrams," IEEE Trans. On Inform. Theory, vol. 55, pp. 2909-2919, 
July 2009. 

[5] T. Etzion, A. Vardy, "Error-correcting codes in projective space," IEEE Trans. On 
Inform. Theory, vol. 57, pp. 1165-1173, Feb. 2011. 

28 



[6] E. M. Gabidulin, "Theory of codes with maximal rank distance," Problems of Informa- 
tion Transmission, vol. 21, pp. 1-12, July 1985. 

[7] E.M. Gabidulin, M. Bossert, "Codes for network coding," Proc. IEEE Intern. Sympo- 
sium on Inform. Theory (ISIT), Toronto, Canada, July 2008. 

[8] M. Gadouleau, Z. Yan, "Constant-rank codes and their connection to constant- 
dimension codes," IEEE Trans. On Inform. Theory, vol. 56, no. 7, pp. 3207-3216, 
July 2010. 

[9] T. Ho, R. Kotter, M. Medard, D. R. Karger, and M. Effros, "The Benefits of Coding 
over Routing in a Randomized Setting," Proc. IEEE Intern. Symposium on Inform. 
Theory (ISIT), Yokohama, Japan, June- July 2003. 

[10] M. Jafari Siavoshani, S. Mohajer, C. Fragouli, and S. Diggavi, "On the capacity of non- 
coherent network coding," IEEE Trans. On Inform. Theory, vol. 57, no. 2, pp. 1046- 
1066, Feb. 2011. 

[11] A. Khaleghi, D. Silva, and F. R. Kschischang, "Subspace codes," Lecture Notes In 
Computer Science, vol. 5921, pp. 1—21, 2009. 

[12] A. Kohnert, S. Kurz, " Construction of large constant dimension codes with a prescribed 
minimum distance," Lecture Notes In Computer Science, vol. 5393, pp. 31-42, 2008. 

[13] R. Kotter, F.R. Kschischang, "Coding for errors and erasures in random network cod- 
ing," IEEE Trans. On Inform. Theory, vol. 54, pp. 3579-3591, Aug. 2008. 

[14] R. Kotter, M. Medard, "An algebraic approach to network coding," IEEE/ACM Trans- 
actions on Networking , vol. 11, no. 5, pp. 782-795, October 2003. 

[15] S. Li, R. Yeung, and N. Cai, "Linear network coding," IEEE Trans. On Inform. Theory, 
vol. 49, pp. 371-381, 2003. 

[16] F. Manganiello, E. Gorla, and J. Rosenthal, "Spread codes and spread decoding in 
network coding," Proc. IEEE Intern. Symposium on Inform. Theory (ISIT), Toronto, 
Canada, July 2008. 

[17] R. M. Roth, Introduction to Coding Theory, Cambridge, UK: Cambridge University 
Press, 2006. 

[18] R. M. Roth, "Maximum-rank array codes and their application to crisscross error cor- 
rection," IEEE Trans. Inform. Theory, vol. 37, pp. 328-336, March 1991. 

[19] N. Silberstein, Coding Theory in Projective Space, Ph.D. Research Proposal, Technion, 



Haifa, Israel, May 2008, available online at http://arxiv.org/abs/0805.3528 



29 



[20] D. Silva, F. R. Kschischang, and R. Kotter, "A rank-metric approach to error-control 
in random network coding," IEEE Trans, on Inform. Theory, vol. 54, pp. 3951-3967, 
Sept. 2008. 

[21] V. Skachek, "Recursive code construction for random networks," IEEE Trans, on In- 
form. Theory, vol. 56, pp. 1378-1382, March 2010. 

[22] A.-L. Trautmann, J. Rosenthal, "New improvements on the echelon-Ferrers construc- 
tion," Proc. 19th Intern. Symposium on Math. Theory of Networks and Systems 
(MTNS), pp. 405-408, Budapest, Hungary, 2010. 

[23] S.T. Xia, F.W. Fu, "Johnson type bounds on constant dimension codes," Designs, Codes 
and Cryptography, vol. 50, pp. 163-172, Feb. 2009. 

[24] J.H. van Lint, R.M. Wilson, A Course in Combinatorics, Cambridge, UK: Cambridge 
University Press, second ed., 2001. 

[25] H. Wang, C. Xing, and R. Safavi-Naini, "Linear authentication codes: bounds and 
constructions," IEEE Trans. On Inform. Theory, vol. 49, pp. 866-873, Apr. 2003. 



30 



